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Abstract 

We analyze time series data of the fluctuations of slow solar wind velocity using rank order 
statistics. We selected a total of 18 datasets measured by the Helios spacecraft at a distance 
of 0.32 AU from the sun in the inner heliosphere. The datasets correspond to the years 
1975 — 1982 and cover the end of the solar activity cycle 20 to the middle of the activity cycle 
21. We first apply rank order statistics to time series from known nonlinear systems and then 
extend the analysis to the solar wind data. We find that the underlying dynamics governing 
the solar wind velocity remains almost unchanged during an activity cycle. However, during 
a solar activity cycle the fluctuations in the slow solar wind time series increase just before 
the maximum of the activity cycle. 
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1. Introduction 

Solar wind is a stream of charged par- 
ticles, mostly protons and electrons, that 
are ejecte d from t he up per atmosphere of 
the sun. IParkerl ( 19581 ). who first coined 
the term 'solar wind', showed that the so- 
lar wind is in fact the outer coronal atmo- 
sphere escaping supersonically into the in- 
terstellar space. The solar wind is primar- 
ily divided into two components, a fast wind 
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with velocities ~900 km/s and a slow wind 
with velocities ~300 km/s. These two com- 
ponents differ in their origin and charac- 
teristics which include flow speed, proton 
density, tempe r ature , and helium content 



( IFeldman et all . 120051 : ISchwennl . 120071 ). The 
fast solar wind originates from coronal holes, 
where the magnetic field lines are open, while 
slow wind is associated with open magnetic 
field and is highly st r uctur e d and highly vari 



able ( iKrieger et al.l . Il973t IWoo and Martin 



19970 . 



Observations of solar wind velocity dis- 
tributions made by different spacecrafts 
have been analyzed in considerable detail. 
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Helios spacecraft probed the heliosphere 
as close as 0.3 AU from the sun and these 
observations have been analyzed using 



tools ( 


Vfacek and Oboiska. 


1997. 


1998; 


Macek. 


1998: Macek and Redaelli. 


2000; 


Redaelli and MacekL 2001 




Gupta et al.. 


2008h. 


Macek and Obojskg 


( 


199 7|) argued 



that the inner heliosph ere is a low dimen- 
sional chaotic attractor. iMacek and Obojska 
( 119980 studied a single time series using 
moving average, singular-value decom- 
position and surrogate data analysis to 
characterize the nonlinear behavior of the 



data. IMacek and Redaelli! ( 120001 ) have shown 
that the Kolmogorov entropy is positive and 
finite, as it holds f or a chaotic system. 
Redaelli and Macekl (120011 ) extended the 
study by using a nonlinear filter to obtain 
the noise-reduced data and more reliable 
estimate of largest Lya punov exponent and 



Kolmogorov entropy. iGupta et al.l ( 120081 ) 
analyzed several Helios time series to under- 
stand the local dynamics of fluctuations in 
the slow solar wind velocity. They suggest 
the inherent changes in the dynamics over 
the solar cycle using the variance in largest 
Lyapunov exponent for each time series. 



In this work, we study how the dy- 
namical system governing the slow solar 
wind velocity behaves at different phases 
of the solar activity cycle. We apply the 
method of rank order statis tics of symbolic 
sequence (lYang et all 120031 ) to investigate 



the solar wind time series data. In the next 
Section, we describe the datasets and their 
pre-processing. In Section [3] we describe the 



Table 1: The solar wind velocity data measured 
by the Helios spacecraft in the years 1975 to 1981 
with Carrington rotation(CR); Initial time (Tj); Fi- 
nal time(Ty); Data points(TV). Sunspot numbers are 
from htt p : / /side . oma. be / sunspot-dat a/| 



S.No. 


CR 


Year 


7j 


Tf 


N 


Sunspot 








(day:h:min) 


(day:h:min) 




number 


1 


1625 


1975 


069:21:36 


071:17:42 


3515 


18 


2 


1625 


1975 


077:00:45 


078:20:52 


2938 


22 


3 


1632 


1975 


260:00:35 


261:20:39 


2979 


14 


4 


1632 


1975 


267:03:52 


268:23:56 


3083 





5 


1639 


1976 


085:04:28 


087:00:45 


2245 


38 


6 


1639 


1976 


092:07:20 


094:03:10 


1843 


23 


7 


1646 


1976 


282:11:52 


284:06:39 


1188 


7 


8 


1653 


1977 


099:11:25 


101:07:15 


2741 





9 


1653 


1977 


106:13:51 


108:09:57 


3376 


31 


10 


1660 


1977 


296:17:34 


298:13:48 


1085 


30 


11 


1667 


1978 


114:19:36 


116:14:33 


2393 


106 


12 


1667 


1978 


121:21:18 


123:17:31 


1911 


89 


13 


1681 


1979 


137:05:09 


139:01:23 


1894 


148 


14 


1695 


1980 


145:10:38 


147:07:04 


2218 


229 


15 


1695 


1980 


152:12:36 


154:09:02 


2559 


152 


16 


1709 


1981 


159:17:43 


161:14:02 


3174 


58 


17 


1709 


1981 


167:03:16 


168:16:26 


2562 


119 


18 


1723 


1982 


175:00:30 


176:21:15 


1361 


112 



rank order statistics of symbolic sequence 
technique. In Section 0] we apply symbolic 
analysis technique to time series obtained 
from known nonlinear dynamical systems 
like Lorenz, Rossler, and Chua oscillators 
and extend the analysis to solar wind data. 
In the last Section we give conclusions of our 
study. 



2. Helios data and noise reduction 

We analyze time series data of radial 
velocity (v) measured by the Helios space- 
craft at 0.32 AU in the years 1975 to 
1981, corresponding to parts of the so- 
lar activity cycles 20 and 21 (June, 1976 to 
September, 1986). This data is obtained from 
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Time step Time step 



Figure 1: Pre-processing of an example data set (time series number 16 in table [T]) (a) the original data, 
(b) the detrended data, (c) the filtered data after nonlinear noise reduction, and (d) the noise removed by 
nonlinear filter. 



(IMacek and Obojskal . 119971 ; I Gupta et al. 



2008 



|http: / /sprg.ssl.berkeley.edu/impact/data_bro w»d^eiiAditLgil a fraction of the total sam- 
Although several more time series were avail- pie) were subtracte d from t he raw data 
able, only those were selected that satisfied 
the condition of sufficient (> 1000) data 
points. Thus, 18 time series were extracted 
from the Helios database with the sampling 
time for each time series being 40.5s. The de- 
tails of the time series used in the analysis are 
given in Table 1. In addition, Table 1 (last 
column) contains the corresponding sunspot 
numbers for each time series obtained from 
http:/ /side. oma.be/sunspot-data/. 

In Fig. [T^i , we have shown a solar wind 
time series of 3174 data points recorded 
in the year 1981. The sequence shows a 
general decreasing trend. Therefore, before 
a detailed analysis is carried out, linear 

and quadratic trends (v = at 2 + bt + c, noise level (jSchreiberl . Il993bl ) 



For the data shown in Figfjji, param- 
eters a, b, and c are 0.15, —1.71, and 294.77 
respectively. Fig. [T|d shows the resulting 
detrended time series for the data. 



Solar wind data are believed to be a output 
of a dynamical process upon which the noise 
has been superimposed (jMacek and Obojska . 
19981: IMacek and Redaellil. l2000h. Following 



Macek and Redaellil (120001 ) and iGupta et al. 
fl2008l) we use a nonlinear filter (ISchreiberl . 



1993al ) to clean the signal. This filter aver- 



ages in the embedding space of a chosen di- 
mension 21 + 1 and a defined neighborhood of 
size e, about 2-3 times of the estimated initial 
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Figure 2: (a) Time series (x component) of the Lorenz oscillator, Eq. (5), (b) binary digit corresponding to 
the time series, binary word corresponds to t = 1 is (2 3 .0 + 2 2 .0 + 2 1 .l + 2°.l = 3), (c) probability distribution 
of every 4-bit word. The word index ranges from to 2 L — 1 for L- bit words (complete time series). 



Let Vi,i = 1,2, ...,n be the detrended signal. 
We construct vectors Vi = (vi-i, ...,Vi+i) in 
this 2/ + 1 embedding space. We then replace 
the data point Vi by its mean value in the 
neighborhood of size e, 



- E 



J 3i 



\Vi-Vj\Ke 



Where v\ denote the clean value and iVj is the 
number of neighbors within e, i.e., |Vj — Vj\ < 
e. This process is now iterated taking de- 
creasing e until no neighbors are found. We 
performed two iterations taking the param- 
eters: I = 3, e = 0.12 and I = 6, e = 0.06 
(jSchreiberl . Il993al ). The noise (that is, the 



difference between the detrended data in Fig. 
[Tb and the filtered data in Fig. [Tb) is shown 
in Fig. HH. 



3. Symbolic Analysis 

Time series that originate from dynamical 
systems having complex nonlinear inter- 
ac tions may look v ery much like noise 



(IKantz and Schreiberj . 120041 ) . Dynamical 
system behind such intrinsically noisy 
dataset can be simplified by mapping it to 
the binary sequences, where the increase and 
de crease of fluctuation s are denoted by 1 and 



(IKurths et all Il995h . 



Let Xi,i = 1,2, ...,n be the time series 
data. A part (50 data points) of the time 
series (here a; component of Lorenz oscillator 
(iLorenzl. ll963lV ) : s sh own in Fig. [2b,. Follow- 



ing |Yan£_et_aL| (120031 ). we classify each pair of 
successive data into one of the two states that 
represents a decrease in x, or an increase in x. 
These two states are mapped to the symbols 
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Figure 3: Rank order comparison of two time series from (a) same system (x component of Lorenz oscillator 
at different time), (b) different system (x component of Lorenz oscillator and x component of Rosslcr 
oscillator). 



and 1, respectively: 

S n x n ^ x n —\ 

= 1 x n > x n _i (2) 

We take Si = 0. This binary sequence rep- 
resents a unique pattern of fluctuations in a 
given time series. Fig. [2b shows sequence of 
binary digits for the part of the data shown 
in Fig. [2b,. The resulting symbolic series 
is then partitioned into short sequences of 
given length L. Every short sequence is then 
uniquely identified by a unique word (inte- 
ger), 

L 

w L (k) = J22 L ~ l S> + k-i- (3) 

i=l 

We find the probabilities of different words 
(Fig. [2b) in the complete time series, and 
then sort them according to decreasing fre- 
quency. The resulting rank-frequency distri- 



bution, therefore, represents the statistical 
hierarchy of symbolic words of the time se- 
ries. We compare the rank number of each 
L bit word for two different time series. In 
Fig. [3] we plot the rank number of two time 
series against each other. Fig. [3b shows the 
rank order comparison for time series of x 
component of chaotic solution of Lorenz os- 
cillator at two different time regimes. In Fig. 
[3b rank order comparison for time series of x 
component of Lorenz oscillator and time se- 
ries of x component of Rossler oscillator in 
the chaotic regime is shown. If the two time 
series are similar (from same dynamical sys- 
tem), their rank order points will be located 
near the diagonal line (Fig. [3b). If the two 
time series are from different dynamical sys- 
tems, their rank points will be more scattered 
(Fig. [3b). We define a parameter weighted 
distance for L bit sequence (D L ) between the 



5 



two time series as: 



„ , . Hk = l I R x(™k) ~ Ry( w k) I Px( w k)Py( w k) 
D L {x,y)= -7 ■ (4) 



(2 L - i)E£iP 1 .(«*)j'»K) 



Here x and y are two time series, p(wk) 
and R(wk) represent the probability and the 
rank of a specific word (wk) in the time series 
respectively Value of Dl lies between and 
1. Greater distance indicates less similarity 
between the two time series and vice-versa. 
The distance between same two time series 
is zero. 

4. Results and Discussion 

4-1. Rank order statistical analysis of nonlin- 
ear models 

We take three systems (Lorenz, Rossler, 
and Chua) exhibiting chaotic mo tions. 
Lorenz system is give by (ILorenzl . 119631 ) : 



x = a(y-x), 
y = x(p-z)-y, 
z = xy — (3z. 



(5) 



We choose parameters a = 10, p = 28, and 
= 8/3. 



Rossler system is given by ( jRosslerl . Il976l ) : 



X = -y- z 7 

y = x + ay, 

z = b + z(x — c). 



(6) 



Parameters are fixed at a = 0.1, b = 0.1, and 
c = 18. 



Chua system is given by (IChua et all 119931 ) : 

x = ci(y — x — p(x)), 

y = c 2 (x-y + z), 

z = ~ C 3V, (7) 

where p(x) is defined as, p(x) = mix + 
((mo - mi)(| x+l\-\x-l\))/2. We 
take Ci = 15.6, c 2 = 1, m = —8/7, m 1 =5/7 
and select the parameter C3 = 33 such that 
the motion will be a single-scroll type. 

We take two time series from each system 
at different time intervals. Hence we have 6 
time series in all. They are denoted as L x (t) 
and L x (t ) for the Lorenz oscillator, R x (t), 
R x (t ) for the Rossler oscillator, and C x (t) 
and C x (t ) for the Chua Oscillator. We use 
rank order statistics on these time series data 
by keeping L = 4. Fig. H^i shows distances 
(Dl) for different combinations of these time 
series of length 1000, 5000 and 10000. It is 
clear from Fig. that time series from the 
same system have less distance as compared 
to time series from different systems. Fig. HJi 
also indicates that 1000 data points are suf- 
ficient for this analysis. Similar results are 
obtained when we add noise to the original 
time series. Fig. Hb shows the result when we 
add 5%, 10%, and 20% noise to the original 
time series of 1000 data points. Therefore, 
we can say that this rank order statistics is 
quite robust against the presence of high level 
external noise. 

4-2. Rank order statistical analysis of solar 
wind 

Solar wind data hav e positive largest 



Lyapunov exponent flPavlos et al.l . 11992 
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Figure 4: Distances Dl for different combinations of time series obtained from nonlinear dynamical systems 
(a) for time series of length 1000, 5000, and 10000, (b) with 5%, 10%, and 20% noise added to the time 
series of 1000 data points. 



Macek and Redaellil . 120011 ; iGupta et al.l . 
2008) , positive K olmog or ov entropy 



(IMacek and Redaelli 
fractal dimension 
This leads 



2000, 200 11), and 



(IMacek and Obojskal . 
to the conclusion that 



19971 ) . 

wind dynamics are driven by the complex 
nonlinear interactions. We use rank order 
statistics with L = 4 and time series of 1085 
data points for the analysis. Fig. |5] shows the 
solar activity from March 1975 to September 
1986. It shows monthly averaged sunspot 
index and smoothed monthly sunspot index. 
In the inset daily sunspot index around the 
maximum of solar activity cycle is shown. 
We have shown starting time of our analyzed 
time series on the activity cycle. Time 
series 13, 14, and 15 are located around the 
maximum of solar activity cycle 21. Sunspot 
index corresponding to them are 148, 229, 
and 152 respectively i.e., sunspot index 
increases while going from time series 13 to 
14 and decreases on going from time series 
14 to 15. 



Fig. [6] shows the inter-time series distance 
of solar wind velocity as given by Eq. (|4]). 
Behavior of inter-time series distance remains 
same for all the time series, but the dis- 
tance is maximum with time series 13 (for 
time series 1,2,3,7,9,10,11,12,14,15,18) 
and with time series 14 (for time series 

4, 5, 6, 8, 13, 16, 17), i.e., only two time series 
13 and 14 show least correlation with rest of 
the time series. Fig. [7] shows the most dis- 
tant time series from a particular time series. 
Positions of these time series are around the 
maximum of the solar activity cycle but to- 
wards the increasing phase only. Time series 
15 is also near the solar maximum but it is in 
the decreasing phase. This could be a reason 
for its less distance to the other time series 
as compared to the time series 14. 

5. Conclusions 

We used rank order statistics to analyze 
the time series obtained from known nonlin- 
ear systems. This method is an effective tool 
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Figure 5: Monthly averaged sunspot number (broken line) and smoothed monthly sunspot number (solid 
line) from March 1976 to September 1986. Initial positions of solar wind data sets are shown on the smooth 
curve. In the inset daily sunspot index and position of data sets 14 and 15 are shown. 
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Figure 6: Distances Dl of a time series from others. For example, panel 1 (top left) shows the distance 
between time series 1 and all 18 time series. The distance between same two time series is zero. 



8 




6 8 lO 12 

Serial no. of time series 



Figure 7: Serial number of time series vs. most distant time series serial number. 



for analyzing short time series even in the 
presence of high level of external noise and 
is applied to 18 solar wind time series ob- 
tained by the Helios spacecraft for the years 
1975 — 1982. Nonlinear filter is used to reduce 
the noise present in the data. We found that 
all the time series have least correlation with 
data sets 13 and 14, which are located near 
the maximum of the solar cycle. Although 
data set 15 is also located very close to the 
data set 14 (on the solar activity cycle), it 
shows significant correlation. One possible 
reason for this may be that data set 15 is 
located in the decreasing phase of the solar 
activity cycle, while data sets 13 and 14 are 
located in the increasing phase of the solar 
activity cycle. This suggests the presence of 
larger fluctuations in the solar wind velocity 
data corresponding to the time just before 
the maximum of the activity cycle. 
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